Reinforcement learning

Results: 1147



#Item
171Game theory / Reinforcement learning / Nash equilibrium / Q-learning / Strategy / Partially observable Markov decision process / Action selection / Best response / Bellman equation / Zero-sum game / Agent-based model / Solution concept

Coordination in Multiagent Reinforcement Learning: A Bayesian Approach Georgios Chalkiadakis Craig Boutilier

Add to Reading List

Source URL: www.intelligence.tuc.gr

Language: English - Date: 2009-03-02 16:24:03
172Artificial neural network / Computational neuroscience / Mathematical psychology / Reinforcement learning / Generalization error / Deep learning / Bayesian network / Economic model / Robust statistics / Expected value / Statistics / Machine learning

Stacked Calibration of Off-Policy Policy Evaluation for Video Game Matchmaking Eric Laufer∗ , Raul Chandias Ferrari∗ , Li Yao∗ , Olivier Delalleau† and Yoshua Bengio∗ ∗ Dept. IRO, University of Montreal

Add to Reading List

Source URL: eldar.mathstat.uoguelph.ca

Language: English - Date: 2016-07-12 12:05:04
173Algebraic geometry / Field theory / Valuation / Reinforcement learning / Differential topology

RAAM: The Benefits of Robustness in Approximating Aggregated MDPs in Reinforcement Learning Dharmashankar Subramanian IBM T. J. Watson Research Center Yorktown Heights, NY 10598

Add to Reading List

Source URL: marek.petrik.us

Language: English - Date: 2016-07-14 09:59:52
174Machine learning / Multi-agent systems / Data mining / Formal sciences / Artificial intelligence / Cluster analysis / Reinforcement learning / Agent-based model / Statistical classification / Software agent / Swarm intelligence / K-means clustering

Beyond Reinforcement Learning and Local View in Multiagent Systems Ana L. C. Bazzan KI - Künstliche Intelligenz

Add to Reading List

Source URL: www.inf.ufrgs.br

Language: English - Date: 2014-07-30 07:23:22
175Behaviorism / Reinforcement / Learning / Applied behavior analysis / Response Prompting Procedures

ectacenter-wordmark-print-nospellout

Add to Reading List

Source URL: ectacenter.org

Language: English - Date: 2016-07-18 08:58:50
176Computational neuroscience / Cybernetics / Fellows of the American Association for the Advancement of Science / Richard S. Sutton / Andrew Barto / Reinforcement learning / Artificial neural network / Machine learning / Artificial intelligence / Dalle Molle Institute for Artificial Intelligence Research / Russell Greiner / Timo Honkela

CURRICULUM VITAE Richard S. Sutton April 2015 Professor, Department of Computing Science, University of Alberta address: Athabasca Hall 2-21, University of Alberta, Edmonton, AB T6G 2E8

Add to Reading List

Source URL: webdocs.cs.ualberta.ca

Language: English - Date: 2015-04-07 14:18:17
177Smooth functions / Distribution / Functional analysis / Universal property

GQ(λ): A general gradient algorithm for temporal-difference prediction learning with eligibility traces Hamid Reza Maei and Richard S. Sutton Reinforcement Learning and Artificial Intelligence Laboratory, University of

Add to Reading List

Source URL: webdocs.cs.ualberta.ca

Language: English - Date: 2010-01-22 02:08:08
178Computational neuroscience / Belief revision / Reinforcement learning / Computational statistics / Q-learning / Temporal difference learning / Artificial neural network / Machine learning / Markov decision process / Mathematical optimization / Algorithm / Gradient descent

Sutton, Richard PIN

Add to Reading List

Source URL: webdocs.cs.ualberta.ca

Language: English - Date: 2013-10-18 16:05:54
179Operations research / Dynamic programming / Stochastic control / Markov processes / Reinforcement learning / Markov decision process / Valuation / Algorithm / Mathematical optimization

Learning from Demonstrations: Is It Worth Estimating a Reward Function? Bilal Piot1,2 , Matthieu Geist1 , Olivier Pietquin1,2 1 Supélec, IMS-MaLIS Research group, France

Add to Reading List

Source URL: www.ilhaire.eu

Language: English - Date: 2013-10-03 05:33:46
180Operations research / Dynamic programming / Mathematical optimization / Equations / Decision theory / Reinforcement learning / Markov decision process / Bellman equation / Policy / Partially observable Markov decision process

Journal of Artificial Intelligence Research Submitted 3/13; publishedA Survey of Multi-Objective Sequential Decision-Making Diederik M. Roijers

Add to Reading List

Source URL: arxiv.org

Language: English - Date: 2014-02-04 20:03:22
UPDATE